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Abstract. Let rji, i = 1, . . . , n be iid Bernoulli random variables, tak- 
ing values ±1 with probability |. Given a multiset V of n elements 
vi, . . . , v n of an additive group G, we define the concentration probabil- 
ity of V as 

p(V) ■— SUP P(?7ll>l + • • • T]nV n = v). 

An old result of Erdos and Moser asserts that if Vi are distinct real 
numbers then p(V) is 0(n~2 logn). This bound was then refined by 
Sarkozy and Szemeredi to 0(n^^), which is sharp up to a constant 
factor. The ultimate result dues to Stanley who used tools from alge- 
braic geometry to give a complete description for sets having optimal 
concentration probability; the result now becomes classic in algebraic 
combinatorics. 

In this paper, we will prove that the optimal sets from Stanley's 
work are stable. More importantly, our result gives an almost complete 
description for sets having large concentration probability. 



1. Introduction 

Let r]i,i = 1, . . . , n be iid Bernoulli random variables, taking values ±1 with 
probability ^ . Given a multiset V of n elements v\, . . . ,v n of an additive 
group G, we define the random walk S with steps in V to be the random 
variable S := Ya=i Vi v i- The concentration probability of V is defined to be 



p(V) := supP(S = v). 

Motivated by their study of random polynomials, in the 1940s Littlewood 
and Offord [5] raised the question of bounding p(V) where vi are real num- 
bers. They showed that p(V) = 0{n~ 2 log n). Shortly after Littlewood- 
Offord paper, Erdos pQ gave a beautiful combinatorial proof of the refine- 
ment 



p(V)<^L = 0(n- l 2). (1) 
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Erdos' result is sharp, as demonstrated byV = {l,...,l}. 

The Littlewood-Offord and Erdos results are classic in combinatorics and 
have generated an impressive wave of research, in particular from the early 
1960s to the late 1980s. 

One direction of research was to generalize Erdos' result to other groups. 
For example, in 1966 and 1970, Kleitman extended Erdos' result to complex 
numbers and normed vectors, respectively. 

Another direction was motivated by the observation that (Q} can be im- 
proved significantly under additional assumptions on V. The first such 
result was discovered by Erdos and Moser [2], who showed that if Vi are 

3 

distinct real numbers then piV) = 0(n~ 2 log n). They conjectured that 
the logarithmic term is not necessary. Sarkozy and Szemeredi claimed this 
conjecture in the 1960s. 

Theorem 1.1 (Sarkozy-Szemeredi's theorem, [8]). Let V be a set of n dis- 
tinct integers, then 

p(V) = O(n-f). 

Unfortunately, the result of Sarkozy and Szemeredi does not cover the most 
interesting question (also raised by Erdos and Moser [2]): which sets have 
optimal concentration probability? This major problem remained open until 
the breakthrough of Stanley [9j in the early 1980s. 

Theorem 1.2 (Stanley's theorem). Letn be odd andVo := {— |_§J, • • • , |_§ J}- 
Let V be any set of n distinct integers, then 

P(V) < p(V ). 

A similar result holds for the case n is even [9] . Stanley's proof of Theorem 
ll.2l used sophisticated machineries from algebraic geometry, in particular the 
hard-Lepschetz theorem. Few years later, a more elementary proof was given 
by Proctor [7j. This proof, now becomes classic in algebraic combinatorics, 
is also of algebraic nature, involving the representation of the Lie algebra 
s/(2,C). 

Although Stanley's result has resolved the question of Erdos and Moser, the 
algebraic approach has some disadvantage. First, it does not yield the actual 
value of p(Vo). From Theorem II. 1\ one would guess (under the assumption 

3 

that the elements of V are different) that p{Vq) = (Co + o(l))n~2 for some 
constant Co > 0. The algebraic proofs does not confirm this estimate. 
(Although one can calculate Co by a basic analytical method.) 

Assume that Co exists for a moment, one would next wonder if Vq is a stable 

3 

maximizer. In other words, if some other set V has piV) close to Con - 2, 
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then should V, in some sense, close to Vq ? (Notice that p is invariant under 
dilation.) 

The goal of this paper is to address these issues. 

Theorem 1.3 (Stability theorem). There is a positive constant eo such that 
for any < e < eo, there is a positive number e' = e'(e) such that e' — > as 
e — > 0, and the following holds: if V is a set of n distinct integers and 



p{V) > ( 

then there is an integer I which divides all v £ V and 

v£V v&Vq 




It follows rather explicitly from the proof of Theorem 11.31 that 



p(V)<(^ + o(l))n-l, (2) 

for any set V of n distinct elements of a torsion free group. Here we note 
that 



P(V ) = - + o(l))n-l (3) 

As a byproduct, we obtain the first non- algebraic proof for the asymptotic 
version of Stanley theorem, and in a more general way: our elements can 
be from any torsion free groups. More importantly, in case I^CZ, theorem 
11.31 and its proof reveal a natural reason for Vq to be the optimal set: This 
is the set (modulo a dilation) that minimizes the variance X^ey y2 °f 
random sum S. 

Our method is motivated by a recent idea of Tao and Vu: in certain problems 
on random discrete matrices we need to characterize sets of large concen- 
tration probability (under general assumption on V and rji). The interested 
reader may also see [6],[T2 J ,[13j for more motivations and ideas lying behind 
this paper. 

Theorem 11.31 is proven in two steps. In the first step (Section [3D we give a 
complete characterization of sets having optimal concentration probability 
up to a constant factor. In the second step (Section E]) we exploit this strong 
information to verify the stability part. 
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The first step is to embed the problem into the finite field F p for some prime 
p. In the case when V{ are integers, we simply take p to be a large prime 
(for instance p > 2 n Q^^_ 1 \vi\ + 1) suffices). If V is a subset of a general 
torsion- free group G, one can use Theorem IB.4I fsee Appendix.) 

From now on, we can assume that V{ are elements of F p for some large prime 
p. We view elements of F p as integers between and p — 1. We use short 
hand p to denote piV). 

Fourier Analysis. The main advantage of working in F p is that one can 
make use of discrete Fourier analysis. Assume that 

p = p(V) = P(S = a), 

for some a € F p . Using the standard notation e p (x) for exp( 2 -^p^), we 
have 



p = P (5 = a) = B- £ e ptt( S ~ «)) = E - E e P (£S)e P (-£a). (4) 



p < — ' p 



By independence 

Ve p (£S) = J] epCfovO = J] cos (5) 
i=i i=i p 

Since \e p (— £a)| = 1, it follows that 

1 i 27TUi£ 1 ^ cosvru^ 

p<- y cos =- > . (oj 

p z — ' p p 

£eF p CsFp 

By convexity, we have that |sin7rz| > 2\\z\\ for any z G R, where ||z|| : = 
!MIr/z i s the distance of z to the nearest integer. Thus, 



IT X , 1 o TTX i ■ X n * j , . ■ i 

cos — < 1 - - sin 2 — < 1 - 2 - M < exp(-2 - 2 , (7) 
p 2 p p p 

where in the last inequality we used that fact that 1 — y < exp(— y) for any 
< y < 1. 

Consequently, we obtain a key inequality 
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F £&Fp i 1 i=l F 

Large level sets. Now we consider the level sets S m := {£ : ^£=i ||^|| 2 < 
m}. Decompose F p into {0} U S\ U (S2\S\) U . . . , we have 



n~ c 



1 1 1 

<P< - V exp(-2 — 1| 2 ) < - + - Vexp(-2(m-l))|5 m |. 
p p p p 

Since J2 m >i exp(— m) < 1, there must be is a large level set S m such that 

\S m \ exp(-m + 2) > pp. (9) 
We will use ([9]) as the starting point for our analysis. 

For the rest of this section, we show that S.n- can be viewed as some sort 

100 

of a dual set of V. In fact, one can show as far as cardinality is concerned, 
it does behave like a dual 



\s^\ <^r = ^- do) 

1 ioo 1 \v\ n 

To see this, define T a := X^eV cos ^T^- Using the fact that cos27r,z > 
1 — 50||z|| 2 for any z E R, we have, for any a G SLs. 



r a >^(l-50||ff)>In. 
One the other hand, using the basic identity 

2irax I p if x = 0, 



E 



cos ■ 



p otherwise. 



We have 



a£F„ 



(|10p follows from the last two estimates and averaging. 
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3. A COMPLETE CHARACTERIZATION OF SETS HAVING LARGE 
CONCENTRATION PROBABILITY 

The goal of this section is to demonstrate the fact that as long as p(V) = 

3 

0(n~2) in F p , where p is large enough, then most of the elements of V 
belong to a short interval after an appropriate dilation. 

Theorem 3.1 (Characterization of sets having large concentration proba- 
bility in F p ). Let 5 be a positive constant. Then there are constants C\ = 
C\{5) > and C2 = C2(6) > such that the following holds. Let V = 
{vi, . . . ,v n } be a subset of size n of F p , where p 3> n is a large prime, 
such that p(V) > 5n~z. Then there exists a number k = k(p) £ F p and a 
partition V = V\ U V2 = k ■ (W\ U W2) with the following properties: 

m V in 2 < ^re 3 

• 2^ W £W 2 w - 

The appearance of k is necessary as p(V) is invariant under dilation in F p . 
Proof, (of Theorem I3.ip By (|9|) we have 



\S m \ > exp(— - 2)pp > 5 exp(— - 2)pn 2 (11) 

for some m = O(logra) (it will turn out that m = O(l) later on). 

Structure of S m . Consider a set sequence, S m , 2S m , . . . , 2 l S m , where I is the 
largest integer such that 4 l m < j^. Assume that iq is the smallest index 
such that 12*5^1 > 2.1|2 i - 1 5' m | for 1 < i < i (thus |2*°5 m | > (2.1) i() |5 m |.) 

By the definition of level sets, and by Cauchy-Schwarz inequality we observe 
that kS m C Sj t 2 m holds for all k. In particular, 

2 l S m C S 4 i m C S_n_. 

On the other hand, by the Cauchy-Davenport theorem (see [14, Theorem 
5.4]), we have 

|2'S m | > 2^°)(|2*°S m | - 1). 

Hence, 
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S_»_| > \2 l S m \ > 2^(12^1-1) 

> 2^°)(2.r°|^ m | - 1) 

>2 i -\^r\s m \ 

i i ,2.1. a . ,m . 3 
>1 l -\— r<5exp( T -2)pn-2. 



Insert the estimate < — from (11011 . and 2' = Q(\/~^-) into the above 

I 100 1 — n 1 " ' V V m / 

inequalities, we derive that m = 0,5(1) and iq = 0,5(1). 

Now we consider the set X := 2 io S m . For |X| < \S_s_\ < Theorem E3 

m 1 1 _ 1 1()0 n ' 1 1 

implies that X is Freiman-isomorphism of order 2h to a subset X' of the 
integers (h is a sufficiently large constant to be chosen). 

Next, since \2X'\ = \2X\ < 2.1\X\ = 2.1\X'\, we apply Theorem [R6l and 
then Lemma IB. 91 (since h is large) to obtain an arithmetic progression P' of 
rank 1 and of size \P'\ = Q(\X'\) such that P' C hX'. 

Lifting back to F p , we conclude that hX contains an arithmetic progression 

3 

P of rank one and of size \P\ = @(\X\) 3> pn~z in F p . Since X is symmetric, 
we may assume that P is symmetric. Since hX = h2 l °S m C 5 , ft 2 4l0m , we 
conclude that, with M := h 2 4 io m = O s (l), 



We have showed that there exists a constant C = 0,5(1) and a symmetric 
arithmetic progression with rank one and with step k G F p , P = {ik : 

3 3 

— Cpn^z < i < Cpn~2} such that the following holds for any £ £ P 



P c hX c S M - 




Set 



W := k ■ V( in F p ). 



We then have 
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for all j from the range < j < Cpn~ 2 . 



Summing over j we get 



£ ^ ||^fif<CMjm-l (12) 

weW 0<i< Cp n-| ^ 



Let C" be a sufficiently large constant, we set 



Wl :={w€W:\\^\\>^-}. 

p p 



Viewing F p as {0, . . . ,p — 1}, we observe that if w S Wi then C'rea < u; < 

3 3 

p — C'nz . Thus, provided that CC is sufficient large and < j < Cpn~%, 
there are at least Cpn~z /3 indices j with 6 [§, (in F p ). Note that if 
this is the case, then II — II > 4. Thus we have 



E JI^II 2 >(i) 2 C P „-f/3. 

0<j'<Cpn~2 



Summing over u; G Wi we obtain 



Ev~^ ,, 7^ ,,9 C |TT ^ . 3 

£ iiyii 2 >i^ii^ 

" elyi 0<,<C P n-i 



Together with (|12p . this implies that 

\Wi \ < 48M. 

Set 

W 2 := W\W X . 



By definition, ||^|| = j\\f\\ for all j < p^C'rii)' 1 and for all w e W 2 . 
Thus 
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||— ii 2 > v j 2 ii-i i2 

Vn 

1 p 3 ,|W||2 



P ^ 3 P 

0<j<pn"2 0<j<p(2C'n3 )-! 



> 



64C /3 n 9 / 2 "p" ' 
Summing over uu E Wi and using (|12p . we obtain 



Hence, 



ll W l|2 / \\3 W \\1 



V 1 v || w i|2 < \ ^ 



64C" 3 n 9 / 2 " p ^ ^ , P 

< CMpn 



WGW2 weW H<j<Cpn-% 



T ||-|| 2 <64C7C7 /3 m4, 

concluding the proof. 

□ 

Remark 3.2. The idea to divide sumsets into dyadic sequences was first 
utilized by Szemeredi and Vu in |10j . 

4. AN ESTIMATE ON p(Vo) 

To give the reader an overview for the proof of Theorem II .3| we show in this 
section the estimate ^ for p(Vo) 



24 3 
PW>) = (y— + o(l))n"*. 

View the elements of F p as integers between — (p — l)/2 and (p — l)/2, by 
(SI) we have 



p<M-;En»T-;sn-7 < i3 > 



We split this sum into two parts 
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1 \ - tt ?ri£ 

n ■= ~ V I I cos , 

p ^-^ - LJ - p 

II g ii^ log 2 n «eVb 
11 P"~ „3/2 



1 x - y~Y 

cos . 



e n 



n * — ' J - J - n 



We are going to show that 
Lemma 4.1. 



Sl = - E jQ I COS ^| = (W ho(l))7 



3 

)n "2. 

p * — ' -*■-*• w ' ' V vr 

■ g ||^ log 2 n ieVp 

P 11 - „3/2 



Lemma 4.2. 



x 2 <- ^ ni cos — 



n 3 . 



The two lemmas together imply that p(Vo) = (y — + o(l))n 2 . 

Proof, (of Lemma 14, 1|) The first equality is trivial, as all cos are positive 
in this range of ^. Viewing £ as an integer with absolute value at most 
n~2p\og 2 n, we have 



cos -i = 1 - - + o(i))—£- = exp - 1/2 + o(l) —a 5 - 
p 2 p z y 



Since ^ ie y i 2 = (1 + o(l))^, it follows that 

S X = (l + o(l))/ exp(-(l/2 + o(l))^x 2 ) 



Setting y = \I^^tt~x, changing variables, and using the gaussian identity 



f-oc exp(-^-)dy = 1, we have 



1 

75^ 



Si = (1 + o(l))(^)- 1 / 2 exp(-^)d y = + o(l))n-|, 

completing the proof. □ 
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Proof, (of Lemma l4,2p . To prove Lemma 14.21 we use the upper bound from 
([7]). We split the sum into three subsums, according to the magnitude of 
We make frequently use of the simple fact that if |i|||-|| < 1/2 then 



p 
iL 

P " l-Mlpl 



liill-Ulllil 



Small |||||: < ||||| < ± For all i with \i\ < §, we have ||||| = |i||||| 



E ,,^,,9 9,,^,,9 n 3 log 4 n , 
— =>*-> ^— >41ogra 14 
II p H Hpii - 12 n 3 - & y ' 



Thus, the contribution of this subsum is bounded from above by n 4 . 
Medium |||||: ^ < ||||| < \. By Cauchy-Schwartz, we have 



ii!iii 2 + n^iii 2 > i ii ( j ~^)C ||2 

p p ~ 2 p 

For any £ in this case, let W(£) be the set of pairwise disjoint pairs (i, i') G Vq 
that maximizes the sum M(£) := i')^w(^) I* ~~ ^'| 2 un der the constrain 
that K-i'lll^H < \ for all G W(£). It is easy to check that M(£) > n c 
for some constant c > for all £ in this case (For more details see Lemma 
15.31 ) From here one can conclude that the contribution of this subsum is at 
most exp(n~ n W) = o(n~ 4 ). 

Large |||[|: | < [||[| < |. By Weyl's equidistribution theorem, the number 

of i G Vq such that ||^|| > | is approximately n/2. Thus, for any £ in this 
category 



E »f ii 2 >- 1- <*> 

Hence, the contribution of this subsum is only exp(— Sl(n)). 

□ 

5. proof of Theorem 11.31 

First, by applying Theorem 13.11 there exists a k G F p and a partition V = 

k ■ (Wi U W 2 ) such that \Wi\ < Ci and Y. we w 2 w<2 - C ^ '> where Ci and 
C2 are positive constants depending on e. 
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Let C = C(e) be a large positive constant and c = c(e) be a small positive 
constant. By setting C, c properly and moving at most en elements of W2 
to Wi, we can assume the following property for W2. 

Property 5.1. W2 is a c-irreducible subset of size at least (1 — e)n of the 
interval [—Cn,Cn]. In other words, there is no integer d > 2 which divides 
all but c\W2\ elements of W2. 

Indeed, after applying Theorem 13, 1\ because X^eWb w2 — ^2^-, ah but 
en/2 elements of W2 belong to the interval [— Cn, Cn], provided that C was 
chosen to be large enough. 

To ensure the irreducibility property, we iterate the following process. First, 
if W2, viewed as a subset of integers, can be written as W2 = d-2 • W 2 U X 2 
with some integer d2 > 2 and \W 2 \ > |Wb| — c|W2|, then we remove these 
exceptional c\Wi\ elements to W\ and pass to consider W 2 , which is a subset 
of [-Cn/2,Cn/2]. Next, if W 2 = d' 2 ■ W{ U X% with some integer d' 2 > 2 
and \W 2 \ > \W 2 \ — c\W 2 \, then we remove these c|W^| exceptional elements 
to W\ and switch to consider W 2 , which now belongs to [— Cn/4, Cn/4], 
and keep iterating the process. Observe that whenever reduction occurs, 
the size of the new set drops by at most cn, meanwhile the size of the 
intervals they belong to shrinks by a factor of 2. Thus if c was chosen to be 
smaller than e(4(3 + log 2 C)) _1 , then the process must terminate after at 
most 3 + log 2 C steps, as the set under consideration would then have size 
at least n — C\ — (3 + log 2 C)cn > n — en/2, but the interval containing it 
would be [— n/8,n/8], a contradiction. 

Thus, by applying Theorem 13.11 there exists a k £ F p and a partition 
V = k ■ (Wi U W 2 ){ mod p) such that 

• \Wi\ < en, 

• W2 satisfies Property 15.11 

In the next step, we pass to consider the concentration probability of W by 
the following identity 




We split the sum into two parts, 
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E2:= - E e p(-y) II cos -^- 

II I II ^ lo s 2 " w&W 

Hp 11^ „3/2 



We are going to exploit the structure of W% to show that 
Lemma 5.2. 

™£ 1^.-3 



^2 < - tt cos — < n 
p ^ — ' j - j - p 

II g log 2 n w£W2 
II bII^TW 



1 p 1 



Proof. Making use of ([7]), we split the sum into three subsums, according to 
the magnitude of II -II. 

o II p II 

Small || 1 1| : '"fy" < ||||| < i^bn- Because W2 C [— Cn, Cn], we easily obtain 
that 



E,,w£,,9 v-^ 9,,^,,9 (1 — e) 3 n 3 log 4 n 
— = E ™ £ ^ ^— — V ^ 41 °g n - 
p p 12 n A 

wew 2 w&w 2 



Thus the contribution from this part is bounded from above by n . 

Medium |||||: < ||||| < g^j. To handle this part, we first observe the 
following simple fact. 

Lemma 5.3. Let a £ F p be arbitrary. Let £ £ F p anc? Z > sucZi iZiai 
^ lip II — \- Then the following holds for any sequence < i\ < ■ • • < i m < I 
with m > 4 



V|| ° + > ^ ||2> m V|| 2 (16) 
^ " p " ~ 48 "p" ' 1 ; 

The proof of Lemma 15.31 is quite simple, we defer it to Appendix [XJ Now 
we arrange the elements of W2 as 



—Cn < u>\ < W2 < ■ ■ ■ < V)\W2\ < Cn- 

Set Z := 2 \\l/p\\ ■ Thus 32C < I < Cn. Let i\ be the largest index such that 
Wi x —w\ < I. We then move on to choose %2 > i\, the largest index such that 
Wi 2 — Wi 1+ i < I, and so on. By so, we create several blocks of elements of 
W2 with the property that elements of the same block have difference < Z. 
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Since Wi j+ i — MJi-^+i > I for all j, the number of blocks is less than + 1. 
Next, we call a block short if it contains no more than ^ elements of W%. 
The total number of elements of W 2 that belong to short blocks is bounded 
by _|_ < Mai. Hence there are at least ^ elements that belong 

to long blocks. 

For simplicity, we divide each long block into smaller blocks of exactly Ls^jJ 
elements. The number of such uniform blocks is then at least ^ ^\j%c = 
> Qn We apply ^ tQ each block ( with m = [i/8C\), and then 
sum over the collection of all blocks using the fact that / = wh^n ; 



> II II > — ; 777 H > ' n IHI > n - 

' p I 48 p p 

Thus the contribution from this part is bounded from above by exp(— O(n)). 
Large ll^ll: < ||£|| < ^. Let 5 := r^— and 

° 1 1 p 1 1 64C — Hp 11 — 2 log n 



Wo := {w G W 2 , II — 1| < «5}. 

p 

Assume for the moment that |W^| < (1 — c)|pp2| for some positive constant 
c = c(e). Then 



^112 ^ ,rxr ir2 ^ r2 _ n 



T \\^\\ 2 >c\W 2 \8 2 ^5 2 n^> , . 



and so the contribution from this part is bounded from above by exp(— 0(n log 

Thus, it suffices to show that {W^l < (1 — c)|W^| f° r some sufficiently small 
c. Assume otherwise, we will deduce that there exists a nontrivial d G Z 
that divides all the elements of W!^ which contradicts the c-irreducibility 
assumption of W 2 ■ 

To obtain the above contradiction we first use Lemma IB. 101 to W 2 . It is 
implied that that kW 2 — kW 2 contains an arithmetic progression of rank 
one, Q = {id : \i\ < 771}. 

Because Q C [— 2kCn, 2kCn], the step d must be bounded, 



0<d<—. (17) 

7 



A NEW APPROACH TO AN OLD PROBLEM OF ERDOS AND MOSER 



15 



Let q be an element of Q. By definition, we can write q as a sum and 
difference of elements of W2, 



q = wi-\ Vw k - w k+ i w k . 



Because ||^|| < 5 for alH, by the triangle inequality we have 



\\^\\<2kS. 
P 

Apply the above estimate for all elements of Q, one obtains 



id£ 

I — II < 2kS for all |t| < jn. 
P 



But as 2kS < 4, one must have 



d£ 2kS 
p ~ 

Next, view F p as the interval [— (p — l)/2, (p — l)/2] of Z and consider £ as 
an integer which satisfies < |£| < |. 

By the above bound of ||y ||, we can write d£ = sp + 1, and so 



_ sp + 1 
* ~ ~d~' 

where s,t are integers with \t\ < 

We now collect some useful information for s and d. Firstly, because £ 
has large absolute value, |£| > gj^y, s cannot be zero. Secondly, because 
|£| < p/2 and t are small, d does not divide s. 

Let to' be an arbitrary element of W 2 and consider in Z the product w'£, 



where u/s = s'd + t' with s', i' G Z, and -f < t' < f . 
Because \w'\ < Cn, we have 
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,wt. „ 2kbp Hn 2Ck5 p p 

— - < Cn = < — -, 

a a 7 a 2d 



where in the last inequality we used the fact that 5 is small compared to all 
other quantities. 

We next consider two cases according to the value of tf. We will eventually 
run to contradiction in both cases, thus completing the proof of Lemma [5 .21 



Case 1 : t' ^ 0. Because -£<t' < we have § < |^| < f , and 



so 



p ,t'p wt , p p 

— < — H < - + — . 

2d - 1 d d 1 - 2 2d 



Together with (|TH|). this implies that ||^|| > Hence, by the bound of d 
from (TTj 



|^||>^->5. 
1 p 11 - 4kC 



This inequality violates the definition of W 2 , so Case 1 does not hold as long 
as 5 is sufficiently small. 

Case 2: t' = 0. It then follows that d divides sit?' for all w' G W^- 
recall that d does not divide s, so all the element of W 2 are divisible by a 
nontrivial divisor of d. However, this contradicts with the c-irreducibility 
assumption of W2 ■ 

□ 



We have shown that the contribution of S2 is negligible. Thus, it suffices to 
justify Theorem 11.31 from the following assumption 



El > (1 - e^-n-, = (1 - e)^ «,<-> 



(19) 



We have 
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£ *<-?>II 



t r ttw£ 

— cos 

p ^ ^ 2 ; 11 p 

ii j log 2 n uiew 

< - > cos cos 

i f ' J X. X. i f X J- 

II £ ||^ log 2 n WgWj WGW 2 
Up Hi „3/2 

II g II ^ log 2 n V WGVFl / 

"p"-„3/2 



where 



M(£):= [J cos— ^=exp -(l/2 + o(l)) 



7TU;^ / ^ TT 2 W 2 £ 2 



P \ ' " ^— ' p 2 

wew 2 \ wew 2 

Combining this estimate and the lower bound for £1 we obtain that 



E- 2 ^( 1 + e )^- ( 2 °) 

wew 2 

What remains is to show that a similar sum involving the elements of W\ is 
negligible. To this end, we first show that the elements of W\ do not have 
large absolute values (viewing F p as the interval [— (p — l)/2, (p — l)/2] of 
Z). 



Let C = C(e) be a number so that 



/ eM-^dy = e. (21) 
J\v\>C 2 

As p and n are large and \W2\ > (1 — e)n, we have 

1 E fwM < ( E ^ w2 r 1 ' 2 f *M- y 4)dy 



<H ii|<i£aln w£W 2 

n 3/2 —lip II- T372 



/ 12 / 12 Z" 00 y 2 

<( 1+4e )-V^ e - e v^y_ 00 exp( -y )dy ' 



It thus follows from (|19p that 
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(22) 



lie n< c \ weWi 



Now we are ready to state our estimate on the elements of W\. 

Lemma 5.4. View F p as the interval [— (p — l)/2, (p — l)/2] of Z. Lei wo 
6e an arbitrary element of W\ . Then, 



3 

Proof, (of Lemma l5.4p Assume for contradiction that \wq\ > V^j. Then the 
number of £ G [- J^, 4^] satisfying § < ||^|| is at least (3/4- o(l))|^. 
Denote this set by 7, then for £ G 7 we have 



II — II 2 > ||WqC,, 2 > J_ 

„, P ~~ P ~ 64' 

Notice that the function fw 2 (0 is decreasing in |£| (in the range —-^rpi — 
£ < and |7| > Cp/n 3 / 2 , we have 



si < -E ex p(-ii-n 2 )M(o + - E «p(-ii— f)M(0 
p p p p 

^E ex p(-^)M(£) + ^ E mco 

4 E M(e)-(i-ex P (--^ e m(0- 

||£||< c _c <i|i|K_c 

As p and n are large, the above sum is bounded by 



Si < (1 + 4e) W-^ ( / ex P("T) - - exp ("7il)) / exp (" 
V vr^n^ ^7[ V [<C7 2 64 Jc/2<\y\<C 

^ (1 + 4£) /3£ (v 7 ^ - (1 - - P (-^)) e 2/3 



2 



where in the last estimate we used the definition (|2ip of C and the fact that 
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/ «p(-£)e& > 

JC/2<\v\<C 1 



>C/2<\y\<C 

By choosing e sufficiently small, the upper bound for u\ above is smaller 
than the lower bound in (fTUj) . a contradiction. This concludes the proof of 
Lemma 15.41 □ 



To continue, set 



i- E 

w&Wi 



w 2 . 



By Lemma 15.41 one has \w\ < ^jn 3 ^ 2 for every w E W\. Thus if ||||| < 
Cn~ 3 / 2 , then 



l™£||2 \^ I |2||C|,2 ||C|,2 



e e» 2 = e 



p « — ' p p 

weWi ^ w&Wi L F 

This observation enables us to apply the proof of Lemma 15.41 with Wq re- 
placed by <7i, to obtain a similar bound for o\ 



n 3 



= E - 2 ^ (23) 



Putting (put) and (|2"5j) together, we obtain 



tuEVJ^ iuGW2 

< (1 + ^12' (24) 

where e' — > as e — > 0. 

Let us stop to record what we have achieved so far. For any sufficiently 
large p, there exist a nonzero number k E F p and a set W of integers which 
satisfies (|24p and such that V = k ■ W( mod p). 

On the other hand, note that the number of sets W which satisfy (|24p 
is bounded by a function of n and e'. Hence, there exists a common set 
W = {wi, . . . ,w n } for which Vi = kwi( mod £>),t>i E V for infinitely many 
primes p. 
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As a consequence, for any pair one has p\k(viWj — VjWi), or equivalently 



P\ViWj — VjWi. 



Because this is true for infinitely many p, one has 



ViWj = VjWi(m Z). 



As this identity holds for every pair and as W is irreducible (because 

of either (|24p or Property 15. ip , there exists a nonzero integer I such that 



completing the proof of Theorem 11.31 

Acknowledgment. The author would like to thank Van Vu for very help- 
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tation of the paper. 



Without loss of generality, we assume that to is even. For each j < to/2, by 
Cauchy-Schwarz inequality and the triangle inequality, we have 



V = I ■ W. 



In other words, I divides every element of V and 




Appendix A. Proof of Lemma [5751 



Summing over < j < to,/ 2, we obtain 
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III 

EK^I| 2 >2 E (im-J-ijfUf 
3=1 l<i<m/2 

>\ E j 2 ikii 2 

!<3<m/2 

> ^Kll 2 . 

where in the second inequality we used the fact that i m -j — ij strictly de- 
creases in Z. 

Appendix B. Tools from Additive Combinatorics 

B.l. Preiman homomorphism. We now introduce the concept of a Freiman 
homomorphism, that allows us to transfer an additive problem in one group 
G to another group G' in a way which is more flexible than the usual alge- 
braic notion of group homomorphism. 

Definition B.2 (Freiman homomorphisms). Let k > 1, and let X, Y be 
additive sets of groups G and H respectively. A Freiman homomorphism of 
order k from X to Y is a map <f> : X — > Y with the property that 



%i H \-x k = x[-\ \-x' k => (p(xi) H h (f)(x k ) = 0(Vi) H h 004) 

for all xx, . . . , x k ; x' l} . . . , x' k . If in addition there is an inverse map from 
Y to X which is a Freiman homomorphism of order k, then we say that is a 
Freiman isomorphism of order k, and that X and Y are Freiman isomorphic 
of order k. 

Clearly Freiman homomorphisms preserve the property of being a progres- 
sion. We now mention a result that shows torsion- free additive groups are 
not richer than the integers, for the purposes of understanding sums and 
differences of finite sets ([14, Chapter 5]). 

Theorem B.3. Let X be a finite subset of a torsion-free additive group G. 
Then for any integer k, there is a Freiman isomorphism (j) : X — > <p{X) of 
order k to some finite subset 4>(X) of the integers Z. The same is true if we 
replace Z by F p , if p is sufficiently large depending on X. 

An identical proof also yields a somewhat stronger result below. 

Theorem B.4. Let X be a finite subset of a torsion-free additive group G. 
Then for any integer k, there is a map (j) : X — >■ 4>{X) to some finite subset 
4>{X) of the integers Z such that 
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X\ H h Xi = x[ H h Xj- 44> 4>(xi) H h <j){xi) = (j>(x[) + ... 0(Xj) 

for all i,j < k. The same is true if we replace Z by F p , if p is sufficiently 
large depending on A. 

B.5. Freiman's inverse theorem. A subset Q of an abelian group is a 
Generalized arithmetic progression ( GAP) of rank r if it can be expressed 
as in the form 

Q = {ao + x\a\ + • • • + x r a r \Mi < X{ < M[ for all 1 < i < r} 
for some ao,...,a r and Mi, . . . , M r , M[ , . . . , M' r . 

It is convenient to think of Q as the image of an integer box B := {{x\, . . . , x r ) £ 
Z d \Mi < mi < M'i} under the linear map 

$ : (x\, . . . , Xd) H> a + xiai H h x r a r . 

The numbers ai are the generators of P, the numbers M 4 - , Mj are the di- 
mensions of P, and Vol(<5) := |-B| is the volume of B. We say that Q is 
proper if this map is one to one, or equivalently if \Q\ = Vol(Q). For non- 
proper GAPs, we of course have \Q\ < Vol(Q)- If -Mj = M- for all i > 1 
and ao = 0, we say that Q is symmetric. 

It is easy to verify that if X is a dense subset of a GAP of bounded rank, then 
+ has size not much bigger than X. The celebrated Freiman's inverse 
theorem says the converse. This theorem comes in a number of variants; we 
give one of them below. 

Theorem B.6 (Freiman's inverse theorem). [H, Theorem 5.35] Let 7 > 2 
be a given positive number. Let X be a subset of a torsion-free group such 
that \X + X\ < j\X\. Then for any < e there exists a proper symmetric 
arithmetic progression Q of rank at most |log2 7+eJ and size \Q\ = @ e ^(\X\) 
such that \Xf]Q\= 9 ejT (|X|). 

In our setting, after passing to F p by a Freiman-isomorphism, in order to 
apply Theorem IB. 61 we need a to pass back to torsion- free groups. The 
following result by Green and Ruzsa allows us to do so. 

Theorem B.7. [3, Theorem 1.3] Let X C F p be a set with \2X\ < j\X\ 
and \X\ = Ofc j7 (p). Then X is Freiman isomorphism of order k to a subset 
of a torsion-free group. 

B.8. Sark6zy-type theorem in GAP. Assume that A is a dense subset 
of a GAP Q, then the iterated sumsets kX contains a structure similar to 
Q (see [10} Lemma 4.4, Lemma 5.5], [11, Lemma B3]). 

Lemma B.9 (Sarkozy-type theorem in progressions). Let Q be a proper 
GAP in a torsion-free group of rank r. Let X C Q be a subset such that 
I A I > <5|Q| for some < S < 1. Then there exists a positive integer 1 < 
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k ^«5, r 1 such that kX contains a GAP Q' of rank r and size @s,r(\Q\)- 
Furthermore, the generators of Q' are bounded multiples of the generators 
of Q. If Q and X are symmetric, then Q' can be chosen to be symmetric. 

An easy corollary when Q has rank 1 follows. 

Corollary B.10. Assume that X is a subset of [— Cn, Cn] of size n in Z. 
Then there exists an integer k = k{C) and a positive number 7 = 7(C) > 
such that kX — kX contains a symmetric arithmetic progression of rank 1 
and length 2^n + 1 . 
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